# Visual-Text Conversion
Uae License Detection
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder to process document images
Image-to-Text
Transformers

U
codedrainer
21
2
Donut Proto
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for image-to-text conversion
Image-to-Text
Transformers

D
naver-clova-ix
30
7
Poster2plot
This is an image captioning model that generates plot descriptions from movie/TV show posters. It produces decent plot summaries, though far from perfect. We are continuously improving the model.
Image-to-Text
Transformers English

P
deepklarity
15
4
Featured Recommended AI Models